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Abstract 

Modularity is one of the most widely used quality measures 
for graph clusterings. Maximizing modularity is NP-hard, 
and the runtime of exact algorithms is prohibitive for large 
graphs. A simple and effective class of heuristics coarsens the 
graph by iteratively merging clusters (starting from single- 
tons), and optionally refines the resulting clustering by iter- 
atively moving individual vertices between clusters. Several 
heuristics of this type have been proposed in the literature, 
but little is known about their relative performance. 

This paper experimentally compares existing and new 
coarsening- and refinement-based heuristics with respect 
to their effectiveness (achieved modularity) and efficiency 
(runtime). Concerning coarsening, it turns out that the 
most widely used criterion for merging clusters (modularity 
increase) is outperformed by other simple criteria, and that a 
recent algorithm by Schuetz and Caflisch is no improvement 
over simple greedy coarsening for these criteria. Concerning 
refinement, a new multi-level algorithm is shown to produce 
significantly better clusterings than conventional single-level 
algorithms. A comparison with published benchmark results 
and algorithm implementations shows that combinations of 
coarsening and multi-level refinement are competitive with 
the best algorithms in tlie literature. 



1 Introduction 

A graph clustering partitions the vertex set of a graph 
into disjoint subsets called clusters. Modularity was in- 
troduced by Newman and Girvan as formalization of the 
common requirement that the connections within graph 
clusters should be dense, and the connections between 
different graph clusters should be sparse [50] ■ It is by 
far not the only quality measure for graph clusterings 
[121 |3S] , but one of the most widely used measures, and 
has been successfully applied for detecting meaningful 
groups in a wide variety of complex systems. 

The problem of finding a clustering with maximum 
modularity for a given graph is NP-hard [B], and even 
recent exact algorithms scale only to graphs with a few 
hundred vertices [3 [H HO] . In practice, modularity is 
almost exclusively optimized with heuristic algorithms. 
Like modularity itself, many of these heuristics have 
been proposed in the physics literature. 

A particularly simple heuristic is the iterative merg- 
ing of cluster pairs, starting from singleton clusters, and 
always choosing the merge that results in the largest 
modularity increase. This greedy coarsening can be ef- 



ficiently implemented and produces reasonable cluster- 
ings ^27j llj, but was soon observed to be biased to- 
wards merging large clusters [S] [35] • To remove this bias 
and obtain clusterings with even higher modularity, re- 
searchers suggested to replace modularity increase with 
other prioritizing criteria for potential merges [H [38] . 
and to modify the purely greedy merge strategy |36j . 
However, the numerous proposals have not been or- 
ganized into a coherent design space, and the pub- 
lished evaluation results are largely incomparable due to 
the use of different (and often small) graph collections. 
Therefore, Section [3] systematically describes major de- 
sign alternatives for coarsening algorithms, including 
two new prioritizing criteria for merges, and Section [5. 2 1 
compares them experimentally. 

The clusterings produced by coarsening heuristics 
can be improved with refinement algorithms, which it- 
eratively move individual vertices between clusters |36j . 
An obvious solution is greedy refinement, which always 
chooses the vertex move resulting in the largest increase 
of modularity. However, moving the vertices in arbi- 
trary order (instead of always moving the best vertex) is 
much faster and not necessarily less effective, and adap- 
tations of the classic Kernighan-Lin refinement (5^ are 
not much slower and have some capability to escape 
local maxima. All of these algorithms can be applied 
not only to the original graph, but to any level of the 
coarsening hierarchy, by considering each cluster of the 
coarsening level as a single coarse vertex. This multi- 
level refinement is extremely effective for minimum cut 
partitioning problems [18! T^, but has not previously 
been adapted to modularity clustering. Section [4] de- 
tails the single-level and multi-level refinement heuris- 
tics, and Section fS . 3| compares them experimentally. Be- 
cause the effectiveness of (particularly multi-level) re- 
finement may depend on the coarsening algorithm. Sec- 
tion 5.4 examines various combinations of coarsening 
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and refinement heuristics. 

Section [6] compares public implementations and 
benchmark results of modularity clustering heuristics, 
without a restriction to coarsening and refinement al- 
gorithms. While this is one of the most extensive com- 
parisons in the literature, it is far from exhaustive, be- 
cause implementations and sufficient experimental re- 
sults have not been published for some proposed heuris- 



tics. The main purpose is to demonstrate that partic- 
ular combinations of simple coarsening and multi-level 
refinement algorithms are (at least) competitive with 
the best available heuristics, and thus the results of the 
previous sections are indeed practically significant. 

2 Graph Clusterings and Modularity 

2.1 Graph Clusterings. A graph {V, f) consist of a 
finite set V of vertices and a function f : V xV ^ N that 
assigns a nonnegative edge weight to each vertex pair. 
For simplicity, graphs are assumed to be undirected, i.e., 
f{u,v) = f(y,u) for all u,v £V. The degree deg(w) of 
a vertex v is the total weight X)uev ^) edges. 
The degrees and weights are naturally generalized to 
sets of vertices, e.g., f{V,V) = J2ueVvev fi'^^'")- Note 
thatdeg(V^) = /(y,y). 

A graph clustering C — {Ci, . . . , Ck} partitions the 
vertex set V into disjoint non-empty subsets Ci. 

2.2 Modularity. Modularity is a quality measure 
for graph clusterings. It was originally introduced for 
graphs where the edge weights are either or 1 [50] , 
and was later generalized to arbitrary edge weights [5^ . 
The modularity of a clustering C is defined as 
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Intuitively, the first term is the actual fraction of 
intra-cluster edge weight. In itself, it is not a good 
measure of clustering quality, because it takes the 
maximum value 1 for the trivial clustering where one 
cluster contains all vertices. The second term specifies 
the expected fraction of intra-cluster edge weight in a 
null model where the end- vertices of | deg{V) edges are 
chosen at random, and the probability that an end- 
vertex of an edge attaches to a particular vertex v is 
deg(v) ES]' -'^^ ^^^^ model, the edge weight f{u,v) 
between each vertex pair (u, v) € is binomially 
distributed with the expected value '^^^^^^^y^''^^ ■ 

It can be easily verified that merging two clusters 
C and D increases the modularity by 



AQ 
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and moving a vertex v from its current cluster C to 
another cluster D increases the modularity by 
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3 Coarsening Algorithms 

Greedy coarsening algorithms iteratively merge either 
one cluster pair, as detailed in the first subsection, or 
several disjoint cluster pairs, as detailed in the second 
subsection, and choose the merged cluster pairs accord- 
ing to certain priority criteria, which are discussed in 
the third subsection. 

3.1 Single-Step Greedy. The Single-Step Greedy 
algorithm starts with single-vertex clusters, and iter- 
atively merges the cluster pair with the highest priority, 
until this merge would decrease the modularity. 

Algorithm: Single-Step Greedy Coarsening 

Input: graph, merge prioritizer 
Output: clustering 

initialize clustering with singleton clusters; 
while prioritized pair {C,D) satisfies AQc\d > do 
|_ merge clusters C and D; 

Implementation and Runtime. For the calcula- 



tion of the priorities (see Section 3.3 1 it is necessary to 
quickly retrieve the total edge weights between adjacent 
clusters. These total weights change locally with each 
merge and are thus stored in a dynamically coarsened 
graph where each cluster is represented by a single ver- 
tex. In each merge of two vertices u and v, the edge list 
of the vertex with fewer edges (say u) is merged into the 
edge list of the other vertex. Using the sorted doublc- 
hnked edge lists proposed by Wakita and Tsurumi ^ , 
this requires linear time in the list lengths. However, if 
some neighbor vertices of u are not neighbors of v, then 
one end-vertex of the edges to these neighbors changes 
from utov, and the position of these edges in the neigh- 
bors' edge lists must be corrected to retain the sorting. 

Let n be the number of clusters (initially the vertex 
count), m be the number of adjacent cluster pairs (edge 
count), and d be the height of the merge tree. Merging 
the edge lists of two clusters has linear runtime in the 
list lengths, and each edge participates in at most d 
such merges. Thus the worst-case runtime is 0{dm) 
for the merges and, given that the length of each edge 
list is at most n, 0{dmn) for the position corrections. 
(The implementation of Clauset et al. [7] has better 
worst-case bounds, but experimental results in Section [5] 
indicate that it is not more efficient in practice.) 

In order to quickly find the prioritized cluster pair 
for the next merge, a priority queue (max-heap) over 
the clusters and their current best partner is used. It 
is updated as described in [3H]. In worst case the 
priority queue is updated with each merged edge, taking 
O ( dm log n ) runtime . 
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3.2 Multi-Step Greedy. To prevent extremely un- 
balanced cluster growth, Schuetz and Caflisch intro- 
duced Multi-Step Greedy coarsening, which iteratively 
merges the / disjoint cluster pairs with the highest pri- 
ority (unless the merge decreases the modularity) [55] , 
Single-Step Greedy coarsening corresponds to the spe- 
cial case oi I — 1 (at least conceptually, the implemen- 
tation differs) . To make the parameter I independent of 
the graph size, we specify it as percentage of the num- 
ber of modularity-increasing cluster pairs, and call it 
merge fraction^ The impact of the merge fraction on 
the effectiveness of Multi-Step Greedy coarsening will 
be examined experimentally in Section |5.2[ 



Algorithm: Multi-Step Greedy Coarsening 

Input: graph, merge prioritizer, merge fraction 
Output: clustering 

initialize clustering with singleton clusters; 
while 3 cluster pair (C, D) : AQc.d > do 
/ ^ merge fraction x |{(C, £>) : AQc^d>0}\', 
mark all clusters as unmerged; 
for most prioritized pairs (C, D) do 

if C and D are marked as unmerged then 
merge clusters C and D; 
mark clusters C and D as merged; 



Implementation and Runtime. The same basic 
data structures as in Single-Step Greedy are used. 
To iterate over the I cluster pairs in priority order, 
the edges are sorted once before entering the inner 
loop. This requires O(mlogm) time in worst case. 
Alternative implementations optimized for very small 
merge fractions could use partial sorting with 0{m log I) 
(but a larger constant factor). With merge fraction a 
the inner loop is repeated at least n/a times. However, 
if only few disjoint cluster pairs exist, as in some power- 
law graphs, up to n iterations may be necessary. 

3.3 Merge Prioritizers. A merge prioritizer assigns 
to each cluster pair (C,D) a real number called merge 
priority, and thereby determines the order in which the 
coarsening algorithms merge cluster pairs. Because the 
coarsening algorithms use only the order of the prior- 
ities, two prioritizers can be considered as equivalent 
if one can be transformed into the other by adding a 
constant or multiplying with a positive constant. 

^Recently, Schuetz and Caflisch provided the empirical formula 
lopt '■= «^ /(V, V) for good values of I |37l . It does not outperform 
our formula for unweighted graphs (see Section |6]l, and is unsuit- 
able for weighted graphs, because scaling all edge weights with a 
positive constant changes lopt but not the optimal clustering. 



The Modularity Increase (MI) AQc.d resulting 
from the merge of the clusters C and D is an obvious 
and widely used merge prioritizer [571 13 [Ml HI] • 

The Weight Density (WD) is defined as 



f{C,D) 



and is equivalent to 



AQc. 



Its 
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use as merge prioritizer has not yet been proposed in 
the literature, although Newman and Girvan originally 
introduced the modularity measure to formalize the 
requirement of intra-cluster density and inter-cluster 
sparsity [30^, and Reichardt and Bornholdt showed 
that clusterings with optimal modularity indeed fulfill 
this requirement 33J. 

The Significance (Sig), another new merge priori- 



tizer, is defined as 



AQcD 



and is thus a natural 



7dog(C)deg(D)' 

compromise between Modularity Increase and Weight 
Density. A further motivation is its relation to the 
(im)probability of the edge weight /(C, D) in the null 
model described in Section |2.2[ Under this null model, 
both the expected value and the variance (at least for 
large deg{V)) of the edge weight between C and D are 
'^°^deg(y^^^'' , and the Significance is equivalent to the 
number of standard deviations that separate the actual 
edge weight from the expected edge weight. 

Danon et al. (DA) observed that the Modularity 
Increase AQc,d tends to prioritize pairs of clusters 
with large degrees, and proposed the merge prioritizer 
to avoid this bias [9 . It equals 



min(dcg(C),dog(D)) 

the Significance if deg(C) = deg(D), and is another 
compromise between Modularity Increase and Weight 
Density. 

Wakita and Tsurumi found that greedy coarsen- 
ing by Modularity Increase tends to merge clusters of 
extremely uneven sizes [38]. In order to suppress un- 
balanced merges, they proposed the merge prioritizer 
™™ ( sizc(j) ' size(c) )^Qg,-P' where size(C) is either the 
number of vertices in C (prioritizer HN) or the number 
of other clusters to which C is connected by an edge of 
positive weight (prioritizer HI^). 

Other types of merge prioritizers are clearly pos- 
sible. For example, vertex distances from random 
walks or eigenvectors of certain matrices have been suc- 
cessfully applied in several clustering algorithms (e.g., 
[SU [551 [TT]) . However, preliminary experiments suggest 
that these relatively complicated and computationally 
expensive prioritizers may not be more effective than 
the simple prioritizers in this section |34j . 

4 Refinement Algorithms 

Refinement algorithms perform a local search by iter- 
atively moving individual vertices to different clusters 
(including newly created clusters) such that the modu- 
larity increases. 
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The first three subsections describe simple variants 
of greedy refinement, and the final subsection proposes, 
for the first time in modularity clustering, to apply re- 
finement on more than one level of the coarsening hier- 
archy. Excluded from consideration are algorithms with 
several tunable parameters or explicit randomness, like 
simulated annealing [33l EH [23] or extremal optimiza- 
tion [H]. 

4.1 Complete Greedy. Complete Greedy refine- 
ment repeatedly performs the best vertex move, until 
no further modularity-increasing vertex moves are pos- 
sible. Here the best vertex move is a move with the 
largest modularity increase AQy^D over all vertices v 
and all target clusters D. 

Algorithm: Complete Greedy Refinement 

Input: graph, clustering 
Output: clustering 

repeat 

{v, D) «— best vertex move; 

if AQy^D > then 
|_ move vertex v to cluster D; 

until AQy^D < ; 

Implementation and Runtime. Vertices are 
moved in constant time using a vector mapping ver- 
tices to their current cluster. To find the best move, 
the modularity changes AQy^/j for all vertices v and 
clusters D adjacent to v (and a newly created cluster) 
need to be determined. For this purpose the algorithm 
iterates over the vertices. For each vertex v the summed 
weights f{v, D) are collected in one pass over its edges 
by using a search tree similar to [1]. Given f(v,D), the 
modularity change AQy^u can be computed in con- 



stant time (see Section 2.2 1. Therefore, to find the glob- 



ally best move, all n vertices and m edges are visited 
once, and the weight of each edge is added in a search 
tree of at most n entries. Assuming 0{n) moves yields 
a worst-case runtime of 0{nmlogn). 

4.2 Fast Greedy. Fast Greedy refinement repeat- 
edly iterates through all vertices and moves each ver- 
tex to its best cluster, until no improvement is found 
for any vertex. Finding the best move for a particular 
vertex is considerable cheaper than finding the globally 
best vertex move, as in Complete Greedy refinement; 
the question whether this improved efficiency comes at 
the cost of worse effectiveness will be addressed by an 
experiment in Section |5.3[ Fast Greedy refinement has 
been previously proposed by Schuetz and Caflisch [36' 
and Ye et al. ^41J . 



Algorithm: Fast Greedy Refinement 

Input: graph, clustering 
Output: clustering 

repeat 

foreach vertex v do 

D ^ best cluster for v; 

if AQy^D > then 
|_ move vertex v to cluster D; 

until no improved clustering found ; 

Implementation and Runtime. The implemen- 
tation is very similar to the previous algorithm. The 
worst-case time for one run of the inner loop is 
©(mlogn), and a few runs usually suffice. 

The order in which the inner loop visits the vertices 
seems to have little impact on the obtained modularity 
in practice; in our implementation, vertices are sorted 
by increasing number of edges. 

4.3 Adapted Kernighan-Lin. Kernighan-Lin re- 
finement extends Complete Greedy refinement with a 
basic capability to escape local maxima. The algorithm 
was originally proposed by Kernighan and Lin for mini- 
mum cut partitioning [20] , and was adapted to modular- 
ity clustering by Newman |29| (though with a limitation 
to two clusters). In its inner loop, the algorithm itera- 
tively performs the best vertex move, with the restric- 
tion that each vertex is moved only once, but without 
the restriction that each move must increase the mod- 
ularity. After all vertices have been moved, the inner 
loop is restarted from the best found clustering. Pre- 
liminary experiments indicated that it is much more ef- 
ficient and rarely less effective to abort the inner loop 
when the best found clustering has not improved in the 
last k := 101og2 \V\ vertex moves [34] . 

Algorithm: Adapted Kernighan-Lin Refinement 

Input: graph, clustering 
Output: clustering 

repeat 

peak ^ clustering; 

mark all vertices as unmoved; 

while unmoved vertices exist do 

(u, D) ^ best move with v unmoved; 
move V to cluster D, mark v as moved; 
if (5(clustering) > Q(peak) then 
\_ peak ^ clustering; 

if k moves since last peak then 
|_ break ; 

clustering ^ peak; 

until no improved clustering found ; 
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Implementation and Runtime. The implemen- 
tation is largely straightforward; to improve efficiency, 
the current clustering is not copied unless the modu- 
larity begins to decrease. The worst case runtime is 
the same as for Complete Greedy, assuming that a few 
outer iterations suffice. In practice, Kernighan-Lin re- 
finement takes somewhat longer, because it also per- 
forms modularity-decreasing moves. 

4.4 Multi-Level Refinement. The refinement al- 
gorithms in the previous subsections easily get stuck in 
suboptimal clusterings because they only move individ- 
ual vertices. Even Kernighan-Lin refinement is unlikely 
to move a medium-sized group of densely interconnected 
vertices to another cluster, because this would require 
a series of sharply modularity-decreasing vertex moves. 
However, the vertex group may well have been merged 
into a single cluster at some stage of the coarsening, and 
a refinement algorithm can easily reassign the group if it 
moves entire clusters of this coarsening level, instead of 
individual vertices. This is the basic idea of multi-level 
refinement, which has already proved to be very effec- 
tive for minimum cut partitioning problems |18|, I19j. 

The Multi-Level Clustering algorithm first executes 
a coarsening algorithm (for example, any algorithm 
from Section |3]) and then, usually several times, a 
refinement algorithm (for example, any algorithm from 
the previous subsections). Intermediate results of the 
coarsening algorithm are recorded as coarsening levels 
whenever the number of clusters has decreased by a 
certain percentage, which is provided as a parameter 
called reduction factor. Each coarsening level is a graph 
whose vertices are the clusters at the respective state 
of coarsening. The refinement algorithm is applied to 
every coarsening level, from the coarsest level to the 
original graph. At the coarsest level, each vertex belongs 
to a separate cluster, and at the finer levels, the initial 
cluster membership of each vertex is copied from the 
corresponding vertex of the previous (coarser) level. 

The conventional Single-Level refinement, which 
executes a refinement algorithm only on the original 
graph, is the special case of Multi-Level refinement with 
a reduction factor of 100%. While Multi-Level refine- 
ment, and more generally every decrease of the reduc- 
tion factor, potentially produces better clusterings, it 
may be suspected to significantly increase the required 
runtime. However, this is not necessarily the case, be- 
cause the additional coarsening levels are smaller than 
the original graph, and one cheap vertex move on a 
coarse graph can save many expensive vertex moves on a 
finer graph. The impact of the reduction factor on both 
effectiveness and efficiency is examined experimentally 
in Section [Ol 



Algorithm: Multi-Level Clustering 

Input: graph, coarsener, refiner, reduction factor 
Output: clustering 

// coarsening phase 

level[l] ^ graph; 

for I from i to . . . do 

level -1-1] <— coarsener (level [/], reduction factor); 

if no clusters merged then break ; 

// refinement phase 
clustering ^ vertices of level [/max]; 
for / from Z^ax — I to 1 do 

project clustering from level[Z + l] to level[Z]; 

clustering ^refiner (level[Z], clustering) ; 



Related Work. Several recent algorithms for 
modularity clustering are related to Multi-Level refine- 
ment, but differ in crucial respects. Djidjev's method is 
(despite its name) not itself a multi- level algorithm, but 
a divisive method built on an existing multi-level algo- 
rithm for minimum cut partitioning lOj. Blondel et al. 
use local search on multiple levels to coarsen graphs, but 
do not refine the results of the coarsening Ye et al.'s 
algorithm performs refinement on multiple coarsening 
levels, but only moves vertices of the original graph in- 
stead of coarse vertices (clusters) |?T]. 

Implementation and Runtime. With reduction 
factor a at most log]^/(x-Q)("') coarsening levels are 
generated. For each new level a graph homomorphism 
connecting it to the previous level is constructed and 
used to transfer weights and clusterings between levels. 
To decouple the Multi-Level Clustering algorithm from 
details of the coarsening algorithm, the coarse graph and 
its homomorphism is constructed from the clustering 
produced by the coarsening algorithm: The vertices 
of each cluster are connected to their cluster-vertex in 
0{n) time. For each edge the corresponding cluster- 
edge has to be found or added if not yet existing. 
This search is accelerated by processing all vertices of 
a cluster successively and using a search tree over the 
cluster-edges of the current cluster. Thus constructing 
and connecting all edges costs C'(mlogn) time. 

5 Experiments 

This section experimentally compares the effectiveness 
(achieved modularity) and efficiency (runtime) of the 
various heuristics presented in the previous sections. 

5.1 Experimental Setup. The heuristics were im- 
plemented in C-|— I- and compiled with GCC 4.2.3. The 
implementations are available online at http://www.| 
|inf ormat ik . tu- cottbus . de/'^rrotta7| 
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Figure 1: Modularity by merge fraction and prioritizer. 



In order to compare the effectiveness of the heuris- 
tics, the arithmetic mean of the modularity over a fixed 
set of graphs is measured; higher means indicate more 
effective algorithms. (Thus only the relative values of 
the means are interpreted, the absolute values are not 
intended to be meaningful.) Generated graphs are not 
used because they are structurally very limited (e.g., 
in their vertex degree distribution), and do not neces- 
sarily permit generalizations to graphs from real appli- 
cations. Instead the graph set contains 58 real-world 
graphs retrieved from various resources as listed in Ap- 
pendix]^ The available graphs were roughly classified 
by their application domain and graphs of diverse size 
that fairly represent all major domains were selected. In 
addition the collection includes commonly used bench- 
mark graphs like Zachary's karate club network [32] • 
The graphs range from a few to 75k vertices and 352k 
edges. 

All runtimes were measured on a S.OOGHz Intel 
Pentium 4 processor with 1GB main memory. The 
time for reading the graph was excluded to avoid that 
it interferes with the aspects studied here. 

5.2 Coarsening Algorithms. Figure [l] compares 
the effectiveness of the merge prioritizers for Single-Step 
Greedy coarsening (represented by a merge fraction 
of 0%) and Multi-Step Greedy coarsening with merge 
fractions of 2%, 5%, 10%, 20%, 50%, and 100%. No 
refinement was used. The runtime measured on the 



Figure 2: Runtime by merge fraction and prioritizer on 
the graph 'DIC28_main'. 



graph 'DIC28_main' is shown in Fig. [2] and is typical 
for larger graphs. 

Concerning the merge prioritizers, Wakita's HE 
and HN are much less effective than the others, and 
not more (usually even less) efficient. The lower effec- 
tiveness is also visible in Fig. |9] for Wakita's original 
implementation. 

Concerning the algorithms, Multi-Step Greedy is 
generally less effective and less efficient than the simpler 
Single-Step Greedy. Only for Modularity Increase, 
Multi-Step Greedy is faster and, for merge fractions of 
2% and 5%, also slightly more effective, but still similar 
to Single-Step Greedy with Danon and Significance. 
Apparently the other merge prioritizers do not benefit 
from Multi-Step Greedy's tendency to balance cluster 
sizes because, unlike Modularity Increase, they have no 
strong bias towards merging large clusters. 

5.3 Refinement Algorithms. Figure [3] compares 
the effectiveness of the refinement algorithms for Single- 
Level refinement (reduction factor 100%) and Multi- 
Level refinement with reduction factors of 5%, 10%, 
20%, and 50%. As coarsener Single-Step Greedy 
with the Significance prioritizer was chosen, because it 
proved to be effective and efficient in the previous sub- 
section. The runtime measurements on 'DIC28_main' 
are shown in Fig. [4] and the dependency of the runtime 
on the graph size is depicted in Fig. [5] 
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Figure 5: Runtime ratio of the refinement heuristics to raw coarsening, log-log scaled. Reduction factor is 50%. 
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Figure 3: Modularity by reduction factor and refine- 
ment method. 
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Figure 4: Runtime by reduction factor and refinement 
method on the graph 'DIC28_main'. 



Multi-Level refinement with a reduction factor 
of 50% turns out to be more effective than Single-Level 
refinement, and similarly efficient. Reduction factors 
below 50% do not considerably improve the modularity, 
but significantly increase the runtime for Fast Greedy. 

Fast Greedy refinement is about as effective as 
Complete Greedy, and just shghtly less effective than 



Kernighan-Lin, but much faster. It scales well with 
the graph size (see Fig. |5|, while Complete Greedy and 
Kernighan-Lin become prohibitively expensive. 

5.4 Combining Coarsening and Refinement. 

Concerning Single-Level vs. Multi-Level refinement, 
Fig.|6]shows that Multi-Level refinement is consistently 
more effective for all merge prioritizers, and thus con- 
firms the results for the Significance prioritizer in Fig.jSj 

Concerning Single-Step vs. Multi-Step Greedy 
coarsening. Fig. [7] shows that for the best merge pri- 
oritizers, both are similarly effective with Multi-Level 
refinement, while Single-Step Greedy is more effective 
without refinement. Clearly, Multi-Level refinement 
benefits from the uniform cluster growth enforced by 
Multi-Step Greedy coarsening. Overall, Single-Step 
Greedy coarsening is still preferable because of its 
greater simplicity and efficiency. 

Concerning the merge prioritizers. Figs. |6] and [7] 
show that Modularity Increase is only competitive with- 
out refinement (ignoring efficiency) , and Weight Density 
is only competitive with Multi-Level refinement. Here 
Multi-Level refinement benefits from the bias of Weight 
Density towards balanced cluster growth, and suffers 
from the bias of Modularity Increase towards unbal- 
anced cluster growth. Danon and Significance are ef- 
fective with and without refinement. 

5.5 Conclusions. The best algorithm found in these 
experiments is Single-Step Greedy coarsening with 
Danon or Significance as merge prioritizer combined 
with Multi-Level Fast Greedy refinement (or Multi- 
Level Kernighan-Lin, if efficiency is no concern). 
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Figure 6: Mean modularity by merge prioritizer. Left 
bars show reduction factor 100% (Single-Level), right 
bars 50% (Multi-Level). Both use Single-Step Greedy. 



Figure 7: Mean modularity by merge prioritizer: Left 
bars show merge fraction 5% (Multi-Step) and right bars 
0% (Single-Step). Both use reduction factor 50%. 



Interestingly, Single-Step Greedy refinement out- 
performed the recent and more complex Multi-Step 
Greedy (for the best merge prioritizers) , the Danon 
and Significance merge prioritizers clearly outperformed 
the much more widely used Modularity Increase (espe- 
cially with refinement, and considering efiiciency) and 
Wakita's prioritizers, and the newly proposed Multi- 
Level refinement consistently outperformed the popular 
Single-Level refinement. 

6 Related Algorithms 

An exhaustive review and comparison of the numerous 
algorithms for modularity clustering is beyond the scope 
of this paper; the purpose of this section is to provide 
evidence that our recommended heuristic - Single-Step 
Greedy coarsening by Significance with Multi-Level Fast 
Greedy refinement (SS+ML) - is competitive with the 
best existing methods. 

6.1 Basic Approaches. Algorithms for modularity 
clustering can be categorized into the following four 
types: Subdivision heuristics try to divide the net- 
work, for example by iteratively removing edges [30] 
or by recursively splitting the graph using eigenvec- 
tors |29j . Coarsening (or agglomeration) heuristics iter- 
atively merge clusters starting from singletons. Cluster 
pairs can be selected based on random walks [M] [32] , 
increase of modularity [71 |36l HI]) or other criteria 
|38l l^[TT]. Local search heuristics move vertices between 
clusters, with Kernighan-Lin-style and greedy search be- 
ing the most prominent examples. Other approaches in- 
clude Tabu Search Extremal Optimization [H], and 
Simulated Annealing [33j [211 [23]. Finally, mathemat- 
ical programming approaches model modularity maxi- 
mization as a linear or quadratic programming prob- 
lem which can be solved with existing software packages 
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Table 1: Best published modularity values for four al- 
gorithm classes, compared to the modularity values for 
our heuristic SS-I-ML. Where possible, missing values 
were substituted with results from published implemen- 
tations (shown in italics). 



6.2 Published Modularity Values. Table [Tj com- 
pares modularity values from various publications with 
the results of our heuristic SS-I-ML. Mathematical pro- 
gramming approaches consistently find better cluster- 
ings than SS-I-ML, though by a very small margin; how- 
ever, they are computationally much more expensive 
and do not scale to large graphs [U [40]. Compared 
to the best algorithms in the three other classes, the 
results of SS-I-ML are very competitive, and for large 
graphs significantly better. 

6.3 Published Implementations. In order to di- 
rectly compare our heuristics with existing algorithms, 
a range of publicly available implementations was re- 
trieved from authors' websites and through the igraph 
library of Csardi and Nepusz |H]. Only a subset of the 
graph collection could be used as some implementations 
cannot process graphs with weighted edges or self-edges. 
The employed 23 graphs range from a few to 75k ver- 
tices and are marked with UW in Appendix |X] In some 
of these graphs negligible differences in edge weights and 
small amounts of self-edges were removed. 
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Figure 10: Runtime of the published implementations on unweighted graphs, log-log scaled. 



□ with refinement 

□ normal 



Figure 8: Mean modularities from four published imple- 
mentations and our (approximate) reimplementations 
on unweighted graphs. 



□ with refinement 

□ normal 



Figure 9: Mean modularities from the published imple- 
mentations and our recommended heuristic SS-Sig+ML 
on unweighted graphs. 



Caflisch in [37]), and the algorithm of Pons and Lat- 
apy [3T] based on short random walks (here of length 4) . 
The examined local search heuristics are simulated an- 
nealing of Reichardt and Bornholdt [33] (here with at 
most 120 clusters) and the recent hierarchical algorithm 
of Blondel et al. [4 . 

Concerning the performance of the (approximately) 
reimplemented heuristics, the mean modularities from 
the published implementations are roughly reproduced 
or slightly improved by our implementations (Fig. [S]) - 
even for Schuetz and Caflisch, where the computation 
of the parameter I differs (see Section 3.2 1. Note that 
refinement is not available in the implementations of 
Wakita and Tsurumi and of Clauset et al., and is 
optional in the implementation of Schuetz and Caflisch. 

Concerning the performance of our recommended 
heuristic, Single-Step Greedy coarsening by Significance 
with Multi-Level Fast Greedy refinement (SS-Sig+ML), 
only Reichardt and Bornholdt 's implementation pro- 
duces clusterings of similarly high modularity, but it 
is much slower, and only Blondel et al.'s implementa- 
tion is faster, but it produces worse clusterings (Figs. [9] 



and 10 1. Even the still simpler and faster variant with 



Single-Level refinement (SS-Sig+SL) produces compet- 
itive clusterings, notably in comparison with the recent 
algorithm of Schuetz and Caflisch which is more com- 



plex and requires parameter tuning (see Section 3.2 1 



The included coarsening heuristics are the fast 
greedy joining of Clauset et al. [7], the algorithms of 
Wakita and Tsurumi [38 , the recent multi-step greedy 
algorithm of Schuetz and Caflisch [36, (with parameter 
/ — 0.25^y f{V, V)/2, as recommended by Schuetz and 



7 Summary and Conclusion 

Various coarsening and refinement heuristics for mod- 
ularity clustering can be organized into a design space 
with four dimensions: merge fraction (including Single- 
Step and Multi-Step Greedy coarsening), merge prior- 
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itizer, refinement algorithm, and reduction factor (in- 
cluding Singlc-Lcvcl and Multi-Level refinement). In an 
experimental comparison of achieved modularities and 
runtimes, some widely used or rather complex design al- 
ternatives for example, Multi-Step Greedy coarsening, 
merge prioritization by Modularity Increase, or Single- 
Level refinement - were outperformed by newly pro- 
posed or simpler alternatives particularly Single-Step 
Greedy coarsening by Significance with Multi-Level Fast 
Greedy refinement. In a comparison with published 
implementations and benchmark results, this heuristic 
was more efficient than algorithms that achieved simi- 
lar modularities, and achieved higher modularities than 
algorithms with similar or better efficiency. 
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A The Benchmark Graph Collection 

Table [2] on the next page lists graphs used for the ex- 
periments. The graphs postfixed with '_main' just con- 
tain the largest connectivity component of the original 
graph. All graphs from the subset 'UW were used with- 
out edge weights and self-edges for the experiments on 
published implementations. For each graph the source 
collection is named in the last column. Web addresses to 
these collections are listed in Table |3] For information 
about the original authors please visit the respective 
websites. 
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Table 2: Graph collection. 



source 



web address 



Arenas 
ANoack 

Cx-Nets http 
GraphDrawing http 
Newman http 



pajek 
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ihttp : //deim. u rv. cat/~ aarenas/data/w elcome .htm 
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//www-personal .umich.edu/~mejii/netdata7r 



http : //vlado .fmf . uni-lj . si /pub/networks /data/ 
http : //www. weizmEinn. ac . il/mcb/UriAlon/ 



Table 3: Graph sources. 
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